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Method and apparatus for Image processing 
This invention relates to three-dimensional image generation 
using a variety of types of interrogating radiation, including 
5 electromagnetic, particle and acoustic radiation. In 
particular; it finds application in the processing of images 
produced with X-rays. 

In conventional X-ray tomography the apparatus produces 
images of a single slice of an object by moving an X-ray camera 
10 and the image plate in a complementary manner so that everything 
is blurred except for a predetermined plane. Commercial 
machines produce images of a number of separate slices, moving 
the X-ray machine to produce a planar picture and then moving 
the object under test in the Z direction to produce a planar 
15 picture at different slices. That needs more total pictures 
than the views spread over a solid angle and it produces a 
cruder representation of a three-dimensional image because 
volume cells intermediate between slices have to be derived by 
interpolation. 

20 Conventional tomography thus produces images of discrete 
slices normal to the axis of objects of short axial length. It 
is more common, however, to require images to discrete slices 
normal to the axis of a long object. The established form of 
computer tomography <CT) is designed to do this. It does it by 

25 taking a number line images, at different viewing angles in the 
plane of each such slice (typically 360-1440 views, taken at 1° 
to H° intervals). These are then used to compute the 
corresponding slice image. However, the invention - and even 
the established form of computer tomography - Is better 

30 understood by first postulating an extension of computer 
tomography which, unlike existing systems, would produce a true 
three-dimensional mapping of the X-ray densities (i.e. 
opacities) of a complete set of symmetrical, equally-spaced 
volume cells (voxels), in place of merely producing such 

35 information for each of a limited number of parallel slices 



WO 93/09512 



PCT/GB92/01969 



- 2 - 

through the relevant object. This hypothetical version of 
computer tomography might map a cubical volume L 3 into (L/c) 3 
voxels of size c 3 . In its simplest form, this postulated ideal 
form of computer tomography would involve L/c distinct X-ray 
5 projections, of (L/c) 2 pixels. Each pixel represents the 
aggregate density of a column of L/c voxels. The individual 
densities of all voxels are then computed by (L/c) 3 simultaneous 
equations. No assumptions about the features of the specimen 
are involved- The resulting picture is fully three-dimensional, 
10 amenable to three-axis rotation, slicing etc. 

In practice, the voxels are not independent:- signal/noise 
considerations require a significant degree of smoothing and 
integration (by the human eye or brain and/or by a computer 
algorithm)* which causes the effective Image to be composed of 
15 reasonably smooth -multi -voxel elements. 

We have found that It 1s possible to make .use of this 
assumption to simplify data extraction, storage and processing. 

The processing may be simplified by treating an object under 
test as a set of component objects, each defined by one or more 
20 closed surfaces, where the precision of the surface shape of 
these component objects should ideally match the resolution of 
the sensor system. Indeed, for smoothly-varying surfaces, a 
least mean-squares or similar fit can produce a precision 
superior to that of the individual constituent measurements. 
25 Furthermore these component objects are generally of 
individually uniform specific density, and of reasonably 
"well-behaved" shape. (Complex shapes may be broken down into 
multiple constituent objects of simpler shape.) 

These component objects can be cross-identified, between 
30 different views, by projection on to the axis common to each 
pair of such views. Each such projection also defines the 
three-dimensional position of two outermost surface points of 
the object (as projected on to this common axis), with a 
precision matching that of the source pictures. Thus n views 
35 generate n(n-l) such points on each object surface, and these 
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points cluster most closely In the areas of greatest curvature 
on these surfaces. The line contour segments between the points 
can also be derived with considerable precision. This generates 
a network of n(n-l)+2 surface cells. The three-dimensional shape 
5 of these cell surfaces can also be derived with good precision. 
(If there are any inflexions or sharp corners or other point 
singularities, their position in three dimensions can similarly 
be identified from any two suitable projections, thus producing 
some additional nodes and meshes in this network.) 

10 On this basis, about ten views, yielding 92 cells per 
surface, are likely to be to be sufficient for most 
applications. At the other extreme, 32 views (still a minute 
number compared with conventional computer tomography) would 
generate 994 cells. This permits the use of much simpler 

15 algorithms to generate the network of line-segments linking the 
points and to derive the surface of the resulting mesh cells. 
We have found, in practice, tolerable reconstructions can be 
obtained with as few as five source images, whilst anyone 
wishing to ignore the curvature of line segments might wish to 

20 use up to 100 source views. 

The representation of the observation volume as a set of 
three-dimensional objects matches the physical reality. This 
enables the technique to cope with higher noise levels than 
equivalent conventional systems. It also provides a convenient 

25 basis for determining specif 1c densities and for providing 
user-friendly facilities of object selection, enhancement, 
rotation, slicing, and general manipulation. 

It is possible to regard a specimen as composed of a limited 
number of solid "objects", which each: 

30 1. Comprise a substantial number of voxels; 

2. Have a reasonably "well-behaved" geometric shape; 

3. Have a reasonably smoothly-varying projected density. 
When applying the invention to X-rays, used for medical 
applications, a significant proportion of clinical situations 

35 match this scenario. A single functional physiological entity 
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of more complex shape will then normally comprise two or more 
"objects" of this type. Diagnosable anomalies are departures 
from the expected size, shape and density of these objects or 
the presence of "unscheduled" extra objects. The requisite 
5 three-dimensional information can be extracted from, say, about 
ten X-ray projections from different directions. 

The resultant picture is amenable to three-axis rotation and 
to slicing, just as with three dimensional pictures extrapolated 
from conventional forms of tomography. In addition, however, 

10 any desired subset of objects can be readily selected or deleted 
(or enhanced, attenuated or coloured) at the user's discretion. 
Thus this approach gives the benefits of conventional forms of 
tomography - and some further user-friendly features - with 
dramatically fewer X-ray exposures and reduced computation. 

15 According to the present invention there is provided a 
method of processing images of a specimen containing at least 
two regions of different opacity to the passage of radiation 
comprising irradiating said specimen with radiation from a 
source, detecting said radiation with a detector sensitive to 

20 said radiation to produce a plurality of signals Indicative of 
the amplitude of said radiation passing through said specimen in 
a corresponding plurality of different directions, processing 
said plurality of signals to derive further signals Indicative 
of contours of maximum local discontinuity in opacity or texture 

25 thereby providing an indication the position of the 
three-dimensional boundary surface between said two regions. 

Preferably said further signals are subject to further 
processing to subdivide the representation of said surface into 
a plurality of elementary substantially planar surfaces to 

30 facilitate the manipulation of images of said boundary surface. 

The invention will now be particularly described by way of 
example with reference to the accompanying drawings, 1n which: 
Figures la and lb show obscuration of objects; 
Figure 2 Illustrates conventional tomography 
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Figures 3 and 5 show arrangements of computer tomography 

and Its extension with linear and array 
detectors respectively; 

Figure 6 is an Illustration showing two views of a 

5 complex shape; 

Figure 7 shows a three-dimensional scanning 

arrangement; 

Figures 8 to 28 are diagrams used to explain the method 

1n accordance with embodiments of the 
10 Invention; and. 

Figures 29 to 36 are computed reconstructions of a bone. 
With conventional single X-rays. e.g. for orthopaedic 
medical applications or airport luggage screening, each picture 
element (pixel) represents the aggregate attenuation due to the 
15 X-ray densities of all the volume cells (voxels) traversed by 
the corresponding ray. Such pictures, valuable as they are, 
suffer from a number of significant limitations:- 

* There 1s no Indication of absolute or even relative 
depth; 

20 * An opaque object 1 3 , anywhere i n the ray path , 
obscures any other object 11 in that path, be it 
nearer or further than the opaque object. (Figure 1); 

* the single two-dimensional view may be little help for 
visualising the three-dimensional shape of objects. 

25 * the picture is depth-dependent and hence distorted see 
(Figure 1):- 

Let the photographic plate (or equivalent sensor array) be 
the X,Y plane 15, with Z=0. A point source at X=0, Y=0, Z«=R 
will then map any object point x,y,z on to the Image point 
30 x R/(R-z), y R/(R-z) 

It is therefore normal practice to take two or three views 
at radically different angles. However, even then, some quite 
demanding mental gymnastics may be required to resolve the 
Images into a set of objects, deduce their position in 
35 three-dimensions, and visualise their three-dimensional shape. 
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Certainly the resulting information is less complete or precise 
than desirable for planning detailed surgical procedures, and so 
a patient may have to be cut up more drastically than should be 
necessary. It is certainly less than ideal for focusing the 
5 appropriate damaging radiation on to tumours, kidney stones 
etc- The information is totally Inadequate for the direct 
control of surgical Instruments, or for defining the precise 
size and shape of dental or orthopaedic prosthetics, and 
controlling their manufacture. Hence the provision of 

10 prosthetics may involve separate surgical operations for several 
consecutive trial fittings, before an acceptable device is can 
be permanently implanted. 

Since the surgeon is trained to visualise small 
three-dimensional objects from a stack of suitably spaced 

15 micrototome slices, viewed in his microscope, conventional 
tomography seeks to provide similar information from his X-ray 
equipment. - ("Tomography" has the same etymological root as 
"microtome 11 .) In this technique, the point-source of radiation 
is above the target object, and the photographic plate for 

20 equivalent sensor array) is below it and parallel to the desired 
slice. The source and sensor are then progressively displaced, 
in opposite in directions, (normally in one, but Ideally In two 
dimensions) parallel to the slice plane (Figure 2). Provided 
these displacements are proportional to the respective spacings 

25 from the chosen slice plane, the ray passing through any given 
object voxel in the selected slice plane will always terminate 
at the same image pixel, producing an enlarged but undistorted 
picture of the selected si tee. However, at all other levels In 
the target object, the ray sweeps through a two-dimensional 

30 array of distinct voxels, thus smearing their contributions to 
this pixel into a substantially uniform grey background to the 
picture, which can be largely eliminated by spatial low-pass 
filtering. However, the contrast so achieved 1s somewhat 
limited. 
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Conventional tomography 1s well suited to producing views in 
planes parallel to the major axis of long thin object, see A-A 
in Fig. 1. However, this is precisely the information which can 
best be visualised from a single conventional X-ray. Hence the 
5 user is far more interested In slice views such as B-B. 
Computer Tomography (CT) meets this demand. Computer Tomography 
uses a line array of sensors, in the plane of the desired slice, 
and a point source, facing it on the opposite side of the object 
(F1g. 4). Both rotate through equal and opposite angles through 

10 360°, so that views through that slice can be taken, typically 
at intervals of 0.25°. (In principle, 180° rotation would be 
sufficient but, with divergent sources, reciprocal views are not 
identical, and continuous rotation, possibly in a spiral scan, 
is mechanically more convenient.) In each view, each ray 

15 generates, in the corresponding sensor pixel, a signal which 
defines the aggregate attenuation due to all the voxels (within 
that slice) traversed by the ray. 

As indicated in Table 1, a 1,000-polnt sensor, with a 
1,000-point resolution along each ray, can then produce a 

20 1,000-line (10 6 pixel) image. However, this requires 1,000 
independent exposures to collect the necessary data. Assume we 
need to distinguish ten discrete voxel density values, with an 
average value of 5. Each image pixel then gives the sum of 1000 
such voxel densities. Hence we need to define this aggregate 

25 density of around 5000 with an accuracy of ±0.5, i.e. a 
signal/noise ratio of 80dB. Each slice undergoes 1000 such 
irradiations, thus exposing the patient to a total radiation 
dosage which is at the limit of permissible risk, acceptable 
only in case of critical medical need for the information. 

30 Furthermore, all the irradiation, for all the slices, has to 
take place in very rapid succession:- The patient has to keep 
quite still and hold his breath, to minimise movement until the 
records from all the slices are completed. This entails an 
intensity and continuity of X-ray generation which is near the 

35 limit of tube technology, and can be achieved only by aiming the 
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X-ray beam at a source spot at the edge of a rapidly-rotating 
metal targets, to prevent melting. Finally, each slice requires 
the determination of one million unknown voxel densities from 
1000 simultaneous equations of 1000 terms each, normally using a 
5 specialised array processor. 

Computer tomography produces very high-quality 
two-dimensional Images, with a resolution, typically, of 1000 
lines. The third dimension, however, depends on the 
interpolation between slices. Normally 30 or 40 sTIces are 
10 taken; very exceptionally this might be Increased up to 100. 
However, the inter-slice" spacing can be adjusted so that the set 
of /slices just brackets the region of Interest. 

In some Instances, the coputer tomography source beam has 
been broadened somewhat In the axial dimension, normal to the 
IS slice, and the sensor array has accordingly been widened from a 
single-line array to .an elongated rectangle, so as to yield 
full, uniform three-dimensional resolution within slices of 
moderate width, and so - via multiple slices - in the whole 
volume of interest. This, of course. Increases the 
20 computational load and data- storage requirement even further. 

Information defining the full three-dimensional volume could 
be found by a symmetric extension and generalisation of the 
principle, employed in the generation of a single 
two-dimensional computer tomography slice, as Illustrated in 
25 Figure 5. However, this scheme would employ a full-width 
two-dimensional detector array. In place of deriving a 
two-dimensional slice distribution from a number of 
one-dimensional recordings spread over 180° (or 360°), it 
derives the full three-dimensional volumetric density 
30 distribution from a number of two-diens1onal recordings spread 
over a solid angle of 2ir Cor 4ir) Table I (analogous to Table 1) 
shows that 1 ,000 exposures, each of 1,000 by 1,000 pixels, would 
be required to map the independent densities of a full set of 
1,000 by 1,000 by 1,000 (I.e. 10 9 ) voxels. The spherical 
35 symmetry of data acquisition, In this scheme, optimally matches 
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the symmetry of three-dimensional space. In other respects, 
however, this scheme 1s similar to the "fan-beam tomography 11 of 
the preceding paragraph. 

The schemes outlined above all acquire, process and analyse 
5 data as if the user were interested 1n a set of 10 9 independent 
voxel densities. In practice, of course, the user could not 
Interpret such single cells, and he 1s only interested in 
discrete physiological entitles and their characteristics. It 
is desirable to look for discrete Objects 1 of relatively 
10 'well-behaved 1 shape, each containing substantial number of 
voxels of similar specific density. The aim is:-. 

* To provide high precision in three-dimensions; 

* To match the acquisition and representation of the 
data more nearly to their nature and use, thus 

15 simplifying acquisition and making displays more 

responsive to the user's wishes and needs; 

* To condense and simplify the data and to permit more 
efficient data and image manipulation; 

* To reduce the number of separate irradiations of each 
20 object voxel - roughly 100-fold; 

* To reduce the intensity of each individual constituent 
exposure - at least 100- fold; 

* Thus to make the aggregate exposure small enough for 
wide use, whenever helpful to the diagnostician or 

25 surgeon. 

An object-based scenario is excellently matched to 
non-medical applications, such as Industrial inspection, and 
security inspection at airports etc. It is also an adequate 
representation of a high proportion of medical applications, 

30 particularly orthopaedic ones. This obiect-nature of our 

subject material is the basic assumption underlying our 
approach . In fact, we do not depend on the source material 
being very simple in shape:- Any pronounced change or of 
projected density - or of texture - will be treated as an 

35 interface to another "object". The scheme can also accommodate 



WO 93/09512 



PCT/GB92/01969 



- 10 ~ 

objects of spatially varying density, or indeed some totally 
non-object-like material. 

Let us consider what is special in object-based images and 
how can we benefit from it. An object is any region in space 
5 enclosed by a discontinuity in density or texture. Each 
projected view of an object generates the corresponding outline, 
in. the image plane. Any pair of such image planes intersect on 
a common axis. Hence we can identify, on both views, the 
extremities of the object as projected on to this axis. (Figure 
10 6). (These points correspond to the Intersection of the two 
outline loci on the real object.) Thus we obtain two "common 
points' 61-63 whose two-dimensional position is known in two 
distinct planes, and so we define their position in three 
dimensions. 

15 The incident radiation is divergent and hence the pictures 
produced are generally both enlarged and distorted. However, 
each point on either view will define precisely the direction of 
the ray linking it to the corresponding point source. Hence we 
displace both points Incrementally, on either side of their 

20 initial position, until we find the pair of positions for which 
the corresponding rays intersect (or, at worst, their minimum 
separation is least) This intersection - exact or interpolated, 
is then the required three-dimensional location. A simple 
algorithm of progressively smaller incremental steps can 'home 

25 in 1 on to this intersection (or quasi-intersection) in a small 
number of iterative attempts. If the shape is re-entrant, there 
will normally be two additional local 'extremities' in both 
views, thus providing the three-dimensional location of two 
further points. If there are any sharp corners or other 

30 singularities these, too, can be identified. (Figure 6.) 
Indeed, the more complex the shape, the greater is the number of 
points required, but the greater is also the number of points 
generated. (Any sharply discontinuous attachment or hollow 
would generate a discontinuity, in the projected cumulative 

35 density, and so would be created as a separate object.) To set 
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a lower limit, let us assume that we are dealing with an object 
of simple shape, with no deep dents or awkward projections and 
no sharp corners to provide further identifiable points. Thus, 
for each pairing of two views, just two common points can then 
5 be defined in three-dimensions, and so n views generate n(n-l) 
pairings. (Table 3) We have found that approximately ten views 
will generally be sufficient (with 30 views as an extreme upper 
limit). 

The point of Intersections of two locus lines Is subject to 

10 a tolerance proportional to the cosecant of their subtended 
angle. With ten views the most closely spaced views differ by 
45°, which should cause no problems. However, with 30 views the 
most closely spaced views differ by only 25°. If we therefore 
preclude each view from pairing with four near neighbours, this 

15 still leaves 30 x 25/2 view pairings, a minimum of 750 points.) 

The viewing axis of the source 43 and diametrically opposite 
two-dimensional sensor 41, can be steered 1n one angular 
dimension by rotating the inner arc 71 of Figure 7 about a 
horizontal axis, similarly to existing computer tomography 

20 equipments. However, in place of sliding the subject axial ly 
through the computer tomography scanner, the second dimension 1s 
obtained by rotating either the platform on which the subject 
rests or the outer arm 73 of Figure 7 about a vertical axis. 

To spread the viewing angles uniformly, we link them to 

25 appropriate features of a regular polyhedron. Ten directions 
are obtained from the centres of half the faces of a regular 
icosahedron (20-sided polyhedron); the other ten faces would 
merely give the reciprocals of the same axes. (With the 
irradiation diverging - typically by *10° - ±15° - views from 

30 reciprocal directions are not fully identical. However, the 
views from the reciprocal hemisphere contribute little 
additional information.) 

For 30 views, we place three viewing points near the corners 
of each relevant facet 81-85, spaced equally both from each 

35 other and from the nearest points on the two neighbouring 
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facets. (Figure 8) Similarly, 3 points per face of an 
octahedron yields 12, and 4 per face <3 near the corners and 1 
in the centre) 16 viewing angles. The mid-points of the edges 
of a dodecahedron yields 15 viewing axes. In a cube, 4 points 
5 per face yields 12, and 5 per face 15 axes. In a tetrahedron, 4 
points per face would yield 8 axes. 

Rapid, very high-precision, two-dimensional mechanical 
rotation of viewing angles, round a human body stretched out at 
full length Cor indeed relative to any other subject) would 
10 involve expensive equipment. Moreover, such mechanical movement 
could be psychologically disturbing to a patient. The movement 
would have to be sufficiently rapid that the patient can hold 
his/her breath and keep immobile during the relevant interval. 
Some residual movement, due to heartbeats, would be inevitable. 
15 Since our system requires only, say, ten views, these views are 
preferably provided by means of ten fixed X-^ray sources and 
sensor arrays- The sources could however be excited in rapid 
succession, rather than truly simultaneously, to avoid 
side-scatter excitation from one source generating 'noise 1 In 
20 the sensor array for another source. (Groups of suitably 
separated sources could be activated concurrently.) 

With nearest-neighbour separations of viewing axes of 
approximately 45°, some of the viewing axes, in the upper, 
hemisphere, will have to be within 22.5 b of the horizon. Figure 
25 9 uses the specific example of a source 91 at 20° elevation, 
with an angular spread of *15°, to illustrate how sources can be 
positioned in the upper angular hemispheres to avoid obstruction 
of their radiation by the sensor arrays in the lower 
hemisphere. If the sensors are to face exactly towards the 
30 relevant source, each source must have its own sensor array. 
This is quite feasible, without mutual obstruction by these 
arrays, provided 2S, the angular spread of the angle of 
irradiation from source to sensor array, is less than G, the 
minimum angular separation between viewing axes (Figure 10). 
35 Small values of S require large squrce-to-subject spacings. 
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This may be mechanically unwelcome, but 1t does avoid any 
claustrophobia by the patient. 

The configuration of sources and sensors 1s preferably 
matched to the largest volume needing to be viewed 
5 simultaneously: as a (somewhat extreme) upper limit, say, an 
ellipsoid of 90cm (length) by 60cm (width) by 40cm (depth). For 
smaller volumes, the spread of the source beams can be 
restricted by suitable 1rises : . The dominant length dimension 
determines the source-to-subject and subject-to-sensor 

10 spadngs. The biggest source-to-subject spadngs could however 
be roughly halved, 1f, of our ten views, those substantially 
normal to the length dimension (say 4-6 views) were covered by 
2 distinct (sequentially excited) sources each, and by two 
half-length sensor screens. (The subject-to-sensor distances 

15 are however determined by the viewing angle S, and so are 
unchanged.) This configuration also reduces the ratio of 
ma jor-to-mi nor axes in the source Irradiation angles, and would 
reduce the side-scatter "noise" In the resulting picture. It 
may also be possible to narrow the source X-ray beams in one 

20 angular dimension and then to step scan them to restore their 
angular cover, thus permitting a further reduction in 
side-scatter noise. The sizes of the arrays would be matched to 
the projected views of the ellipsoid. Assuming a resolution of 
2000 angular cells (anxels) for the 90cm dimension, the average 

25 view would require approximately (ir/4)(1000/90) 2 (90x60x40) 2/3 
angular cells, i.e., for ten views, a total of 14 million 
resolution cells to be separately read, e.g. from a 
photo-stimu Table storage phosphor (like the "Fuji plate"). In 
this latter case the Interogating laser beam should be In line 

30 with the primary radiation beam. It will, furthermore, be 
necessary to have a separate plate for each viewing location. 

The separation of the sensor screens, for distinct views, 
requires large separations between the subject and the sensor 
screens S x in Figure 10. It is desirable to use a sensor screen 

35 closer to the subject. This permits a smaller screen, needing 
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fewer resolvable elements, to be used repeatedly, for several 
views, by time-sharing the excitation of the relevant sources. 
This does however, require the sensor screen to be read and 
reset very rapidly, and 1t requires the screen elements to 
5 accept radiation over an appropriate range of angles. in 
Figure 10 shows the limiting contour for a two-fold overlap of 
angles in one dimension. Thus the right-hand boundary of view A 
and the left-hand boundary of view C cross on the centre-line of 
view B. Hence the element at that intersection would have to 
10 accept radiation over the limits of ±(6-6). However, if the 
subject-to-sensor distance is marginally Increased, the 
acceptance tolerance, for all elements, 1s only ±c = ±(0-S)/2. 
For ten views, 6 « 45 a and S « 15°, as assumed in Figure 10, 
this entails c « 15°* 
15 A small part of the screen might even be common to three 
views, differing In two angular dimensions. However, as a rough 
estimate, 80% of elements will serve two views and 20% only one 
view. This corresponds to a 40% saving, giving a total of 
roughly 8.5 million resolvable elements on the screen. These 
20 elements can form a common sensor screen, covering the full 
solid angle of 2-rf in a continuously-curved surface. This surface 
does not need to be hemispherical, but may be suitably matched 
to the shape of the limiting ellipsoid. With time-sharing where 
necessary, subsets of this screen can cater for any number of 
25 views or any change of viewing angles, given the appropriate 
number of additional sources or source positions. 

Smooth shapes with a monotonic change in tangent angle will 
establish common points at approximately equal increments of 
tangent angle. Interaction with 9 other views will establish 9 
30 pairs of common points, corresponding to angular spacings of 
20°. Any inflexions increase the total number of tangent 
points. Considering each segment of an inflexion as if 1t were 
part of a raonotonically curved contour shows we maintain the 
condition of one point per 20° of positive or negative angular 
35 change. However, a sharp corner, will generate one additional 
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singular point, even though It may represent a much larger 
angular change than 20°. (Similarly, 30 views will produce 
reference points at approximately 6° intervals.) Since the 
points defined on the objects will be substantially evenly 
5 spread, 1n tangent angles (not In linear separation), see Figure 
11, they cluster closely 1n highly-curved regions, and are 
widely spread in smooth, flat parts of the surface: the optimum 
distribution of a given number of points. Furthermore, each 
oblect generates its own set of points and their distribution 

10 automatically adapts to the size of the object . However, for 
small objects, the accuracy is ultimately limited by the "grain" 
of the sensor array. 

The n(n-l) surface points are interconnected by a network of 
2n(n-l) line segments whose three-dimensional track we can 

15 infer, with high accuracy, from their known end-points and their 
known projected contour. The first outline drawn on a closed 
surface divides it into two regions. Each subsequent outline 
crosses each pre-existing one twice, and so adds two mesh-cells 
per pre-existing outline. Hence this network will define the 

20 surface as a set of n(n-l)+2 mesh cells 121. (Figure 12). Most 
of these cells will be bounded by 4 line segments, but some will 
be triangular and some may have 5 or more sides. However, in 
all cases, two projected contours intersect at each vertex, so 
forming four quadrants. Hence these cells may be found as 

25 follows:- 

1. Select a vertex where less than all 4 associated 
quadrants form parts of already established mesh cells. 

2. Select one of the associated line segments where not 
more than one side forms part of any already 

30 established mesh cell. 

3. At the other end of this segment, select one of the two 
line segments of the intersecting contour, where the 
quadrant defined jointly by this and the original line 
segments does not form parts of any already established 

35 mesh eel 1 . 
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4. At the other end of this further line segment, select 
that one of the two line segments of the intersecting 
contour, whose end point is nearest the starting vertex. 
5- Repeat step 4, for consecutive further line segments, 
5 until the starting vertex is regained. 

The sides of the cells so formed will generally be curved 
and, in a cell of four or more sides,, the corners will not 
normally be coplanar. We have devised algorithms for generating 
a curved surface, fitting these conditions. However, we prefer 
10 to represent each curved line segment by three straight lines, 
in the following manner. 

Assuming the curved segment is equivalent to a circular arc, 
the optimum 3-line approximation is obtained as Illustrated in 
Figure 13:- 

15 * Let the straight line of length L, joining A and B, the 
end-points of the line segment, be the X axis, with A 
as the origin. 

* Find points P and Q on the curve, corresponding to 
X«0.3L and X-0.7L respectively, and construct the line 

20 PQ. 

* At X « 0,5L, find E, the Y-difference between line PQ 
and the curved line segment. 

* Form line P f Q r , parallel to PQ but displaced by E/2 in 
the Y direction. 

25 * The 3-line approximation is then AP'Q'B. 

A simple analysis shows that, for a circular arc, this 
representation reduces the peak and mean positional errors 
16-fold, compared with the straight line AB. The reduction in 
the peak gradient error is only 2; 7-fold, but realistic surface 

30 gradients may be restored, following all other image 
manipulation operations, by appropriate smoothing. 

The true positions of A 1 and B 1 , the end-points of any given 
line segment, are known in three-dimensions since, by 
definition, they are common to two distinct views. However, the 

35 position of the two interpolated points, P' and Q 1 (Figure 13) 
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is only defined by one locus-line each, normal to the projection 
plane. The safest assumption 1s then that which 

* Is the minimum departure from A'B*, the straight line 
between the known end-points of the segment, 

5 * Is certain to yield a better representation than the 
single 1 ine A'B" . 

* gives the mean of all possible positions. 

These conditions are met if P' and Q' are projected on to 
the plane, through A 1 B 1 , normal to the plane A'B'BA. (Figure 
10 14) 

When one contour is intersected by 9 others, 18 common 
points are generated, separated by 20°. The centre section of 
the 3-line approximation occupies 40% of this, i.e. 8° (and the 
two end sections are of equivalent accuracy). Since the 

15 approximation error is equally distributed on either side of the 
straight line, the peak error, as a fraction of the local radius 
of curvature, Is 

e = (1- cos 4°)/2 » 1.22 x 10~ 3 . 
For 30 views and the equivalent error would be 1.1 x 10~ 4 This 

20 is an unrealistic figure, but, with single straight lines 
directly joining the endpoints of the curved segments, it is 
reduced to 1.47xl0~ 3 similar to e above. This roughly matches 
the limiting accuracy of practical sensor systems. 

The worst-case measurement error arises when two projection 

25 lines Intersect at the minimum angle, i.e. 45°, at the maximum 
distance from the sensor surface D. An angular error e, in the 
orientation of the source, would then result in a positional 
misalignment of D e sec45°. Hence a precision of 10" 3 D would 
entail e £ 0.04°, probably roughly the limit of the likely 

30 mechanical alignment of practical commercial systems. 

Thus we have converted the circumference of an N-sided mesh 
cell into a non-coplanar polygon of 3N sides. We now represent 
its surface by the most appropriate set of 3N-2 triangular 
facets, by constructing a quasi-optimum set of 3N-3 diagonals 

35 between appropriate points on this circumference:- 
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Since all such diagonals should link points on disparate 
3-line approximations, none can start at a junctions between 
disparate approximations. Hence these N junctions are 'cut off 
by diagonals linking the nearest points on either side, 
5 producing an inner mesh of 2N corners and sides- We then 'cut 
off that corner of this inner mesh requiring the shortest 
diagonal, so reducing the order of the residual mesh by 1, and 
repeat this process until, finally, a quadrilateral is reduced 
to two triangular facets 151,153. (Figure 15) 

10 The four points are the basic mesh nodes; the other eight 
points arise from the three-line approximations. Thus an 
originally four-sided mesh is represented by ten, and a 
three-sided one by seven triangular facets. This representation 
yields dramatic savings in computation for image manipulations, 

15 such as rotation or slicing. Moreover, on completion of all 
such operations, we can smooth the resulting surface, and so 
obtain an excellent reproduction of the continuously-curved real 
surface. 

Our 'objects', may well be less than physiological 
20 entities. Our mechanism creates a distinct additional 'object' 
where-ever boundaries are created, by any of the following: 
(Figure 16) 

1. A maximum of the modulus of the density gradient 161. 

2. A discontinuity in texture type 163. 

25 3. A discontinuity in a texture's orientation 165. 

4. A discontinuity in a texture's lateral alignment 167. 

5. In a proposed extension of the technique:- a 'watershed' in 
the blood-supply to adjacent and otherwise possibly 
Indistinguishable regions of tissue. 

30 Item 1. implies that an abrupt excrescence or cavity 1n a 
single functional entity would split it Into two distinct 
•objects', in each projected view, and hence 1n the derived 
three-dimensional representation. However, the data analysis 
will note the common interface. Furthermore * in an excrescence 

35 it will note the common specific density, and 1n a cavity the 
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reduction of the specific density to that of the filling medium, 
if any. (In many cases, the cavity may In fact be treated, as a 
projection from an adjacent physiological entity.) In all 
instances, there is sufficient Information for associating 
5 multiple constituent 'objects' to form the corresponding 
physiological entitles. The mechanisms listed above will 
normally generate a rather fuzzy and 'noisy 1 representation of 
an object's edge. In order to obtain a best-estimate sharp 
outline, and to condense the data into an amenable format, we 

10 convert the Input data Into a concatenation of 'least mean 
square error' second-order polynomials. 

The span length of these polynomials is adaptively adjusted, 
In constant-ratio steps, to keep the error within specified 
upper and lower limits. The starting point of each polynomial 

15 is the end of the previous one, and the end-point is the mean of 
the 'raw' edge points within a specified limit of the postulated 
span length from that start. The process is automatic, but with 
provision for optional operator intervention. 

In practice we are faced with multiple entities. Each of 

20 these generates its own Independent set of points and 
surface-mesh network, matched to its own size, shape atid 
structural complexity. However, two or more such entities 171, 
173, 175 are likely to overlap, as, illustrated in Figure 17. Ke 
have therefore devised a method for resolving such compound 

25 views into the constituent objects, using criteria such as 
continuity of edge direction and/or of edge characteristics, and 
continuity of local density and /or of density gradient, within 
each constituent object. 

As with a single object, the first step Is to find all edges 

30 and to represent them as chains of 2nd order polynomials. There 
are three versions 181-185 of T-junctions between such contours, 
see Figure 18, and three variants 191-195 of cross-overs, see 
Figure 19. (Variant 195 can arise In any of four orientations.) 
The principal tool for resolving cross-over ambiguities is a 

35 local density comparison. 
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The points being compared should lie on parallels to the 
relevant edges, to be on probable equidensity contours, but not 
too near the edge, where the density gradient is maximum and 
measurements would be error-prone. For convenience, we displace 
5 the edge segment a suitable spacing d normally to its tangent at 
the cross-over point. (Four distinct values of d could be used: 
two sides of two boundaries.) 

Although the edges are unlikely to be straight or to cross 
at right angles, we shall here, for ease of notation, treat them 
10 as the X and Y axes, with the origin at their intersection. 
Thus we determine the following densities (See Figure 20) 
D r , in quadrant 1 at point +d, +d; 
D 2 , in quadrant 2 at point -d, +d; 
D 3 , in quadrant i at point -d, -d; 
15 D 4 , in quadrant 4 at point +d, -d; 

Then, if D x - D 4 - D 2 - D 3 - &Dy, the X axis is the lower 
edge of an object of local density ADy (at distance +d from that 
. edge). Similarly, if D x - D 2 - D 4 -D 3 = AD X , the y axis 1s the 
left-hand edge of an object of local density AD X (at distance +d 
20 from that edge). Thus we proceed from 'initial objects 1 , 
defined by single closed boundary loops to 'intermediate 
objects', corresponding to multiple adjoining closed loops. 
Even these can however be the result of overlaps between a 
smaller number or larger true 'final objects'. The resolution 
25 of multiple overlaps can be simplified by working outwards from 
those 'final objects ' which have already been positively 
identified. 

This overlap resolution process is performed in each of the 
ten or more views. The common identity of a given object is 

30 maintained, from view to view, by noting the common locations of 
its extremities (and of other key features), as projected on to 
the axis common to a pair of views. The same process can also 
be used to resolve any doubt, when analysing potential overlaps 
in a specific view, e.g. to establish whether a high-density 

35 region on one view is a small object in its own right, or is the 
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result of an overlap between too larger objects. If, 
exceptionally 0(p,l), I.e. object p 1n view 1, cannot be 
directly Identified with 0<p,2), 1t should certainly be 
Identifiable via one or more intermediate views: 
5 0(p f l) ->'0(p,x> + O(p.y) -> 0(p,2>. 
When two projection planes are at right angles, the identity of 
a given object in the two views, inferred from common 
end-points, cannot be confirmed by a comparison of shapes except 
via a suitable intermediate view. For the time being, we allow 

10 for some optional operator intervention In overlap resolution. 

In our starting assumption, we postulate that each object 
has a consistent specific X-ray density, S. Hence the aggregate 
density, as seen at a point P, projected in direction e (Figure 
21) is the product of S and d, the depth of penetration of the 

15 relevant ray. If this ray does not penetrate any additional 
objects of unknown density, we obtain S directly. If the ray 
penetrates m objects 221 of unknown S but known d, (Figure 22) 
the aggregate density 1s 

D-EdjSj. (l£i£m> 

20 Hence m such equations suffice to define all m specific 
densities. 

The position of the sampling rays 1s determined, primarily, 
from superimposed outlines in the plane normal to the proposed 
ray direction. It is chosen, by inspection or analysis, so 
25 that:- 

* Each object is penetrated by at least one ray; 

* Each ray penetrates as few objects as possible; . 

* Each ray is as far as possible from the edge of any 
object penetrated by it. 

30 * When many rays are required, these are distributed 
substantially uniformly between all the available 
source views. 

* Multiple rays within a given view are as widely 
separated as the other criteria permit. 

35 Preferably also 
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* The smallest object penetration depth associated with 
each ray Is as large as possible; 

* The penetration entry and exit angles are as nearly as 
possible normal to the relevant surfaces. 

5 Normally m simple equations are sufficient, since extreme 
precision in the estimate of specific X-ray density is rarely 
required- However, we may improve the signal /noise ratio, by 
averaging the densities of a small number of adjacent pixels 
and/or undertake some redundant sampling at distinct locations. 
10 A simple 'rule of thumb 1 , for redundant Input equations, 
would be:- 

* Find the equation with the smallest object penetration 
dj and add It to the equation with the next smallest 
dj, for the same object 1. 

15 * Repeat the process until the matrix has been reduced to 
m non-redundant equations. 

Alternatively, standard techniques for solving redundant 
matrices produce 'least-mean squared 1 error results, enhancing 
20 both accuracy and stability. Furthermore, the variance so 
derived indicates which objects should be treated as 'variable 
density', and the covariance identifies mutual dependencies of 
density values. 

Ideally the error liability of each term in the redundant 
25 matrix could be deduced, from the departure of the associated 
surface entry and exit penetration angles from the normal , and 
the combined estimate could then be weighted accordingly. 
(Figure 23) However, this refinement is unlikely to be 
warranted, tn practice. 
30 The triangular facets penetrated by a sampling /ray' may be 
identified as follows :- 

Make the projection ray the Z axis. From the signs of X and 
Y check that one corner, say A, is the only one in its 
quadrant, and that at least one of the other two corners, B 
35 and C, is in the diagonally opposite quadrant. This Is 



WO 93/09512 



PCI7GB92/01969 



- 23 - 

excellent for short-listing. However, to guarantee that the 
resultant triangles Include the origin 0, requires the 
further condition that corners B and C be on opposite sides 
of line AO. 

5 Once we know the specific densities, we can compute density 
profiles: 

* for any desired direction of view, 

* after any proposed selection of objects 

* after slicing to remove undeslred parts of these 
10 objects. 

The projected vertices form Ideal sampling points, since they 
match the underlying information, and the closeness of their 
projected spacing matches the steepness of the corresponding 
surface gradient. 

15 The contribution to an object from sampling points on its 
outline will be explicitly zero. We can trace the outline as a 
continuous path, without involving irrelevant vertices, as 
follows:- 

* Find an initial point on the outline of the object by 
20 looking for the vertex with the smallest (or largest) X 

(or Y) . 

* From that vertex, (and subsequent edge vertices) find 
the edge that leads to another vertex at which there is 
zero depth of penetration through the object, 

25 * Finish when the initial vertex 1s regained. 

Any smoothing across object boundaries would blur the 
outlines, However the density profiles can be refined by 
smoothing, within the projected outlines of the relevant 
objects, e.g, as follows. 
30 * Construct a tentative set of appropriate square 
macro-pixels, 

When a macro-pixel includes one vertex, assign it the 
density of that vertex. 
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* When it includes several vertices, subdivide this 
macro-pixel in binary steps until none of the reduced 
pixels contains more than one vertex. 

* Macro-pixels not including a vertex derive their 
5 denstty from the four nearest vertices. For this 

purpose, all vertices spread their influence outwards 
in successive concentric square zones 241-247, one 
macro-pixel wide (Figure 24), The zth set of zones has 
the associated weight 1/z 2 . Each relevant pixel notes 
10 the first four densities and weights reaching 1t t to 

derive its density D' -Z(D/z> 2 / I(1/z> 2 . The process 
is terminated when the need of all macro-pixels is 
satisfied 

Realistic densities facilitate comparison with conventional 
15 X-rays. However, for the best computer-generated images, the 
user may interactively ascribe any convenient densities to the 
objects, 

Density mapping is desirable mainly because of its 
familiarity from conventional X-rays. With our full 

20 specification of the three-dimensional surface of each object, 
we can easily select only wanted objects, discard those that 
obstruct the desired view or show only their outline, producing 
a transparent projection of multiple overlapping objects. 

We can then bring out the three-dimensional shape of the 

25 wanted objects e.g. by contour lines, as in a map, or visually 
by computing and displaying the pattern of light scattered from 
the object's surface, when illuminated from any selected * 
direction. 

We can distinguish objects 251,253 by giving them 
30 distinctive colours and/or filling them with distinctive 
patterns, see Figure 25. We can also 'rotate' objects, to view 
them from any desired angle, by a simple coordinate rotation of 
the relevant surface points. 

We can also view everything beyond a selected 'slicing' 
35 plane. If we make the slicing plane Z ~ 0, triangular facets 
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Intersected by that plane are identified by the fact that their 
corners, jointly, Include both positive and negative values of Z. 

Every-day experience 1s best matched by opaque projection 
where a nearer object obscures any more distant one. The 
5 relevant objects (or their unobscured parts) are normally 
represented by their outline, with appropriate distinctive 
colouring or patterns, or by their simulated surface reflectance. 

To assess obscuration, we first rotate the coordinates to 
make the direction of projection the Z-ax1s. Then we define the 

10 rectangular boxes, in X, Y, Z, which just contain the objects. 
When there are no sub-regions of the total volume common to two 
or more such boxes, we choose the box with the smallest minimum 
Z, i.e. the object with the nearest near-point, Insert that 
objects outline on the display, and mark all pixels within that 

15 outline as 'obscured 1 . We then repeat this process with the box 
of the next smallest minimum Z, but this time omit from the 
outline area any pixels already noted as 'obscured 1 . This 
process 1s continued for consecutive further boxes, deleting 
however any boxes whose X, Y dimensions are wholly within the 

20 area already obscured. We finish when all the picture area has 
been obscured, or else all boxes have made their contribution or 
been eliminated. 

If and where two (or more) boxes share a common volume, 
there Is a possibility the 'farther 1 object 261 could locally 

25 overlap the 'nearer' one 263, (I.e. that nearer at 1t? nearest 
point ). (Figure 26) We therefore find their first intersection 
(If any) of the relevant projected X, Y outlines. If, at this 
X, Y, object A has a smaller Z than object B, then A 1s in 
front, and so obscures B, in the common area, up to the next 

30 Intersection of the outlines. 

Ambiguities could however arise 1f: 

* the 'nearer' object 271 is re-entrant (I.e. folded 
over); and 

* the nearer leaf of the fold-over is smaller than the 
35 farther one 273; and 
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* the farther object 275 penetrates into the fold. 

See Figure 27. Such situations can be resolved from outline 
projections normal to the X and Y axes and the density 
profile normal to the Z axis (with provision for operator 
5 intervention). Alternatively, wherever the enclosing boxes 
overlap, we might show the outline of the object "hearer 1 at 
its nearest point normally, and . show that of the farther 
object dotted where the outlines overlap and normally 
elsewhere. • 

10 If aggregate densities were expressed by their 

logarithm, common scale factors would appear as spatial "dc" 
components, which could be filtered out or scaled down as 
desired. the "ac" components would represent density 
ratios. In the absence of noise, this normalisation would 

15 define the edges of "weak" objects just as clearly as those 
of "strong" ones. Furthermore, the edge of an object, where 
its contribution to the aggregate density declines to zero, 
would be dramatically enhanced, since its logarithm, 1f 
unmodified, goes to minus Infinity. 

20 In practice we want to: 

* avoid negative densities 

* avoid Infinite values 

* maintain the same dynamic range of values as with the 
linear scale. 

25 If the linear dynamic range was 1 to 1000, and the 
logarithms are to base 2, we would therefore represent a linear 
density D by 

D* = 100 log 2 (l+D>. 
This form of non-linear amplitude scaling does not merely 
30 help edge extraction, 1t commonly also yields a more useful 
display. A flexible compromise density would be: 
D" = a D* + (l-a)D. 

A subset of objects may be Identified, from the variance of 
their initial, single density value, as being of variable 
35 density. These objects (or at least the sub-subset designated 
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by the operator as of diagnostic importance or others, where the 
coyarlance shows that they affect objects of diagnostic 
Importance), can be treated as an array of cellular subobjects 
with Independent cell-densities. We assume that the density 

5 variations are relatively smooth, since any abrupt discontinuity 
would have generated an ' Interobject 1 boundary. Thus the 
resolution for mapping w1 thin-object density variations can be 
substantially coarser than that defining the object's surface. 
Hence we characterise the relevant objects by a parameter k 

10 which, roughly, represents the number of distinguishable density 
steps per linear dimenslon:- 

* Let the object 281 be contained in AX, Ay, AZ, where 
AX.AY.AZ = L 3 . 

Divide the 'box' AX, AY, AZ Into k 3 cubical cells of side 
15 L/k. The object will then occupy a little less than k 3 cells, 
the interior ones cubical, but the peripheral ones bounded by 
the surface of the object. (Figure 28) Each object would be 
assigned an appropriate k, say 5 £ k £ 25. Typically k«=10, 
yielding «1000 cells with potentially distinct densities, stored 
20 as one word each. The cell locations within the 'box' would be 
Identified by their position within the block of k 3 storage 
locations, and absolute position would be defined by fixing one 
corner of the box. Hence a variable-density object with 
1000-cell boxes would only increase by 10% the total storage 
25 requirement for a fixed-density objects, with ten views (See 
analysis below and Table 8). The regular array of cubical cells 
is very convenient for creating views parallel to any desired 
axis. 

Each view of 1,000 by 1,000 pixels provides 10 6 independent 
30 data points. Hence a total of 10 such views could resolve up to 
3xl0 7 independent cell densities. This corresponds to the 
limits detailed 1n Table 4:- 

Thus the potential input information represents no 
significant restriction on the variable-density objects. Indeed, 
35 there should normally be substantial scope for averaging over 
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groups of associated input pixels, and for taking redundant 
Independent density samples, In order to enhance the 
signal /noise ratio in determining the Individual cell densities. 
When the relevant 'objects 1 are embedded within some form of 
5 background tissue, this has Its own boundary, and simply 
constitutes an additional object. 

Any points, lines, or other small-scale features which do 
not qualify as 'objects 1 , should add little to the aggregate 
density integrated over a ray. Their effect will be further 

10 diminished by the weighted averaging of density values derived 
from several rays. At worst, these features produce a small 
error in the computed specific densities of the objects. We can 
then view and manipulate the objects as before, independently of 
the non-object-like matter. 

15 If, however, we want to view the non-object-Uke matter, we 
first compute the pbject-related density profile in the 
direction of anv of the source views. We then subtract this 
from the source-view Intensity pattern, so that any small-scale 
features (in front of, behind, or Indeed within the objects) are 

20 no longer obscured by those objects. Since some of these 
non-object-like features may to relate to the objects, we may 
include the objects' outlines 1n these synthetic pictures. (In 
a borderline situation, features could be too small and numerous 
to be handled as objects, but too large to be treated as 

25 non-object-Uke.) 

In conventional or wide-fan CT, all relevant material is 
exposed to (typically) 1,000 separate Irradiations. This 
constrasts with, say, ten for the object-based scheme - a 
100-fold reduction. In addition, however, there is an equally 

30 dramatic reduction in the source intensities required. This 
arises because object boundaries have to be identified, rather 
than Individual voxels. Each boundary point benefits then from:- 
* Continuity over a bracket of at least (say) 5 boundary 
points; 
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* A progressive density trend, observed over a strip at 
least 5 pixels wide. Inward from this boundary; 

* Integration In the depth dimension, for each pixel, 
over a column of an average height of at least 5 voxels. 

5 Jointly these three factors provide a 125-fold (minimum) 
signal/noise ratio enhancement, I.e. 21dB. Indeed, contrasting 
the Intensities used for normal X-rays (about 0.05 units) with 
those of the Individual computer tomography irradiations 
(typically 100 units) suggests that this estimate of the 

10 reduction achievable 1s very conservative - or else that 
computer edge-detection is some 16-fold inferior to the 
capability of the human eye. 

Thus, combining the number of exposures and their Intensities, 
the total effective exposure 1s reduced more than 10 4 fold. 

15 A wide-fan computer tomography system, with a 10 6 -p1xel 
slice-plane resolution, a slice width of 20 voxels, and a total 
of ten slices, would have a total Input of 2x10 s density words. 
A standard computer tomography system of 40 slices of lO'-plxels 
each would have a total input of 4x1 0 7 density words. 

20 An object-based system with 10 views of l0 6 -p1xel each would 
have a total input of 10 7 density words. 

(But the less stringent signal/noise requirements would permit 
the use of a shorter word.) 

In the two computer tomography systems, the dominant 

25 computational load is that of resolving the densities of the 
constituent voxels or- pixels, respectively, and so is directly 
proportional to the input data, above. By contrast, In the 
object-based system, the computation is mainly dependent on the 
number of primary surface points generated (I.e. excluding the 

30 3-11 ne approximations). This Is estimated below to be 400 
points per object. These computations have of course to be 
weighted by distinct scale factors:- For the computer tomography 
systems the quantity of computation per voxel is greater, but 
for the object-based scheme the complexity of the computation is 
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greater, and so it Is less suitable for a specialised array 
processor. For the present, we ignore these differences. 

The wide-fan volume was postulated to comprise V « 2xl0 8 
voxels. If this Is in fact made up of n objects, their average 
5 linear dimensions will be d*»(V/n->*. If we now reduced the 
stored data to those voxels lying on the surface of objects, the 
number of voxels to be stored would be reduced by the 
surface-to- volume ratio of a sphere, i.e. 4x3mr 2 /4irr 3 « r/3 = 
d/6. Making some allowance for objects of more complex shape, 

10 d*/10 is a more realistic figure, resulting in lOV^n 54 surface 
voxels. However, we would then have to note the position of 
each such voxel in three co-ordinates, in addition to its 
density. (Previously position could be implied by storing all 
voxels in a predetermined pattern.) Hence the storage 

15 requirement Is only reduced in the ratio 40/(V/n) x . Standard 
computer tomography has less input information and very crude 
information tn the axial dimension Cand accordingly fewer but 
larger voxels). The ratio of edge pixels to total pixels within 
a slice is then the same as the voxel ratio above. In this 

20 case, however, each pixel needs only two position co-ordinates 
fn addition to the density, and so reduction factor is then 
SO/CV/n)* 4 . 

In the object-based scheme, with ten views, each object 
involves, 90 contour intersection points per monotonlcally 

25 curved object surface. Making a generous allowance for possible 
complex re-entrant shapes might increase this total to, say, 400 
points - plus a further 800 points arising from the three-line 
approximations, all associated with three position 
co-ordinates. In addition, let us allow for an average of four 

30 pointers linking these points to the storage addresses of other 
points and, for the contour Intersections only, four pointers to 
the associated mesh cells. (The various pointers, whilst not 
essential, can save much subsequent computation.) This gives a 
total of 10 4 n words. (On the same basis ten views of n 
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objects, with the nodes joined by single straight lines, would 
require 4x1 0 4 n words.) 

The above 'comparisons are collated in Tables 5-8. It 
will be noted that 'Object 3D 1 always yields a quite dramatic 
5 saving 1n radiation exposure, and a very substantial saving in 
Input data and in computation. Even when the voxel data are 
reduced to surfaces only, for reasonable numbers of objects; 
object 3D yields a dramatically more compact and efficient data 
representation. Hence, compared with the alternatives, it 

10 requires only minimal computer storage or processing power for 
inter-active, friendly user facilities. It benefits further, 
because the object concept matches the user's own perception, 
because of the computational efficiency of pointer-linked data 
structures, and because of the Intrinsic simplicity of our 

15 surface representation :- 

* Rotation only entails coordinate conversion for a set 
of 3D points. 

* Slicing merely involves finding the set of straight 
lines cut by the slicing plane. 

20 * An object's outline, in any direction of view, merely 
consists of the relevant set of edges. 
(An outline can never lie within a triangular facet.) 
However, for 1000 or more objects, most of this advantage in the 
storage and manipulations of processed data is lost. 
25 Its relatively modest data and computational 

requirements and its user-friendliness could make the 
object-based technique suitable for a substantially expanded 
market of existing and new applications, 

* in medicine 

30 * in manufacturing industry, for quality-control, 
Inspection (including comparison with a computer-stored 
reference), and metrology), 

* for customs-examination and securi ty-rinspection at 
ports, airports and sensitise sites. 



WO 93/09512 



PCT/GB92/01969 



- 32 - - 

The scheme need not be limited to X-rays: 1t can be applied 
to any transmission-type imaging systems, such as: infra-red or 
neutron radiation, and positron emission. Furthermore, 
conventional X-ray tomography, computer tomography, acoustic 
5 imaging, and magnetic-resonance imaging all generate their 
information as a stack of cross-sectional slices- On each of 
these slices, edges could be detected and objects outlined, as 
in our process. These edges can then be defined as a succession 
of straight-line segments, and the corners of a given object, in 
10 consecutive slices, can be joined to create triangular facets, 
as Illustrated in the next section. (The objects 1 identity can 
easily be tracked from slice to slice.) 

Thus these other sensor modalities can use - and benefit from - 
the same software, the same economy in picture storage, 
15 manipulation and transmission, and the same object-based user 
facilities - even though they do not benefit from our economy 
and optimisation in data acquisition and minimisation of sensor 
exposure. In addition to software, hardware, and user-interface 
standardisation, this say permit the synergistic combination of 
20 multi -sensor data. 

The requirement to interpolate between slices to generate 
•best-estimate 1 three-dimensional objects is common to all 
slice-based three-dimensional data acquisition systems, and a 
number of these use triangular facets for surface 
25 representation. This requirement could be met, in a manner 
consistent with the three-dimensional object-based imaging 
scheme, as follows :- 

In each slice, detect the edges of objects. 
2. Represent each edge as a succession of polynomials. 
30 3. Represent each polynomial segment by a three-line 
approximation, thus generating two additional 'nodes'. 
4. Identify cross sections of the same object, in 
successive slices, by their correlation. 
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5. For each distinct object, and each relevant pair of 
consecutive slices, form a set of triangular 
surface-facets, as defined In steps 6 - 11. 

6. Start at an arbitrary node, N(L t l), on one of these 
5 slices, (say L, the lower one of a pair) 

7. Find N(U,1), the nearest node in the other slice. (In 
this case, slice U, i.e. the upper one) 

8. Test whether this link is also the shortest one from 
N(U,1) to any node in slice L. 

10 9. If NO, move to the next node clockwise from N(L,1) and 
then return to step 7. 
10. If YES, find the shorter of the links N(L,1) + N(U,2) 
and N(U,1) V N(L,2), where In each case 2 denotes the 
next node clockwise, 
15 11. Select that link and, in relation to it . return to step 
10. 12. When, eventually, the link so found 1s again 
the Initial link first found In step 10, END. 
Initial experiments used single computer-generated 
ellipsoids of revolution, and computed their projected outline 
20 in specified directions of view. They then derived the points 
common to pairs of outlines and joined them up in single 
straight lines (rather than three-line approximations to the 
relevant arc). This proved the principal software modules. 

Next we used multiple, computer-generated ellipsoids of 
25 revolution, differing In size, length-to-diameter ratio and 
three-dimensional position, in order to prove our 
overlap-resolution procedure. 

We then repeated the initial experiment, with ten suitably 
distributed outlines of the ellipsoid, and fitted the 
30 appropriate three-line approximations to the resulting contour 
segments, to confirm the accuracy of the reconstituted surface. 

We also acquired a plastic replica bone (as sold for pet 
dogs). For simplicity, we embedded this bone 1n a foam-plastic 
sphere, which could be placed, in any desired orientation, on a 
35 foam-plastic annulus between the source and plate of a 
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conventional X-ray machine. Even with only five views, this 
produced quite realistic-looking three-dimensional 
reconstructions, but the accuracy was limited, because the foam 
plastic sphere did not keep its shape, and because the X-ray 
5 apparatus had no provision for precise angular alignment. (A 1° 
alignment error could result in a positional error of up to 2.5% 
of the X-ray-source-to-plate spacing.) This scheme entailed 
also errors in lateral alignment, both in the initial position 
of the X-ray plate and in its subsequent scanning into the 
10 computer. i 

Hence three specific, mutually orthogonal views were aligned 
with each other, on their common axes, and were then designated 
as 'reference masters 1 . The common^axts projections of all 
other views were then aligned with those masters. This produced 
15 the computed pictures shown in Figures. 29-36. Figures 29-32 
show four different views of Ca) the raw X-rays of a nylon bone 
<b) the edges extracted from (a) and (c) the multiple-polynomial 
graphical representation of Cb>. Figures 33-36 show (a) 3D wire 
frames of the nylon bone at different orientations reconstructed 
20 from five raw X-rays and (b) renedered views of (a). 

Next we arranged to cast a higher-density copy of the same 
1 dog-bone 1 which could he mounted Inside a rigid plastic sphere, 
giving adequate X-ray contrast relative to the spherical shell. 
For the sphere we chose a geographic globe, since this was 
25 already marked with appropriate angular co-ordinates. We then 
attached an aiming telescope to the X-ray apparatus, for better 
angular alignment. 

Other experiments proved the extraction of specific density 
and the reconstruction of computed density profiles for single 
30 and multiple fixed and variable density objects. Yet others 
applied angiography (dye injection into the blood supply) to 
provide enhanced images of the human kidney for object-based 
three-dimensional imaging. 
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Table 1. Characteristics of ideal CT system 

Resolution 1 ,000 lines 

Approx. No. of pixels 1,000,000 

No. of dots per line view 1 ,000 

No. of pixels Integrated 1n each dot 1,000 

No. of views to define all pixels 1,000 



10 



Table 2. True 3D CT voxel-based system 

15 Resolution 1,000 lines 
Approx. No. Of voxels 1,000,000,000 

No. of pixels per view 1,000,000 

No. of voxels Integrated In each pixel 1,000 

No- of views to define all voxels 1,000 

20 



Table 3. Common 3D points from multiple 2D views 

25 min No. of 3D points per pairing of views 2 

mln No. of 3D points for n views n.(n.l) 

min No. of 3D points for 10 views 90 

min No. of 3D points for 30 views 870 



30 
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Table 4 Limiting No. of variable density objects 

k Cells per box Limiting No. of objects 

5 125 240,000 

10 1,000 30,000 

25 15,600 1,900 

100 1,000,000 30 



•10 



Wide Fan CT 
Normal CT 
Object 3D 



Table 5. Comparison of Exposure. 
Relative Irradiation 
10 4 
10* 
1 



15 



20 



Wide Fan CT 
Normal CT 
Object 3D 



Table 6 Input Data 
200xl0 6 
10x1 0 6 
10x10 s 



Table 7. Comparison of Data Points and Computational Load 



Wide Fan CT 
25 Normal CT 
Object 3D 



1 



10 



1.2x10 s 1.2xl0 4 



100 
2x1 0 8 
4x1 0 7 
1.2x10 s 



1000 



1.2x10* 



30 



Table 8. 



n 



Wide Fan CT 
Normal CT 
Object 3D 



Comparison of Processed CSurface-only) Data. 

1 10 100 1000 

4x1 0 6 9x10 s 20x10 s 40xl0 6 

0.6x10 s 1.3x10 s 3x10 s 6x10 s 

10 4 10 s 10 s 10 7 
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Claims 

1. A method of generating Images of a specimen containing at 
least two regions of different opacity to the passage of 
radiation comprising Irradiating said specimen with radiation 

5 from radiation source means, detecting said radiation with 
radiation detector means sensitive to said radiation to produce 
a plurality of signals indicative of the amplitude of said 
radiation passing through said specimen In a corresponding 
plurality of different directions characterised in that said 
10 plurality of signals are processed to derive further signals 
indicative of contours of maximum local discontinuity in opacity 
or texture thereby providing an indication the position of the 
three-dimensional boundary surface between said two regions. 

2. A method of generating Images of a specimen as claimed in 
15 claim 1 characterised in that said plurality Is between four and 

a hundred. 

3. A method of generating images of a specimen as claimed in 
claim 2 characterised in that said plurality 1s between ten and 
thirty. 

20 4. A method of generating Images of a specimen as claimed In 
claim 2 characterised in that said directions are substantially 
uniformly distributed in solid angle over a hemisphere. 
5- A method of generating images of a specimen as claimed in 
claim 4 characterised in that said directions pass normally 

25 through corresponding predetermined points on the faces of a 
regular polyhedron. 

6, A method of generating images of a specimen as claimed in 
claim 1 characterised in that said plurality of signals is 
generated by a corresponding plurality of radiation sources. 

30 7. A method of generating Images of a specimen as claimed in 
claim 6 characterised in that multiple sources are actuated 
successively to inhibit side-scatter excitation. 
8. A method of generating Images of a specimen as claimed in 
any one of the preceding claims characterised in that it 

35 Includes the step of performing a local density comparison for 
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the purpose of resolving cross-over ambiguities. 
9. Apparatus for the generating of images of a specimen 
containing at least two regions of different opacity to the 
transmission of radiation comprising radiation source means for 
5 directing radiation in a plurality of directions through said 
specimen, detector means to detect the amplitude of said 
radiation after passage through said specimen In each of said 
directions, processing means to process signals produced by said 
detector means to derive signals indicative of the contours of 
10 maximum local discontinuity in opacity or texture within said 
specimen and readout means to provide a readout of said derived 
signals. 
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